Search CORE

43 research outputs found

Toll-like receptor signaling in vertebrates: Testing the integration of protein, complex, and pathway data in the Protein Ontology framework

Author: Arighi Cecilia
D’Eustachio Peter
Masci Anna Maria
Natale Darren
Ruttenberg Alan
Shamovsky Veronica
Smith Barry
Wu Cathy
Publication venue
Publication date: 01/01/2015
Field of study

The Protein Ontology (PRO) provides terms for and supports annotation of species-specific protein complexes in an ontology framework that relates them both to their components and to species-independent families of complexes. Comprehensive curation of experimentally known forms and annotations thereof is expected to expose discrepancies, differences, and gaps in our knowledge. We have annotated the early events of innate immune signaling mediated by Toll-Like Receptor 3 and 4 complexes in human, mouse, and chicken. The resulting ontology and annotation data set has allowed us to identify species-specific gaps in experimental data and possible functional differences between species, and to employ inferred structural and functional relationships to suggest plausible resolutions of these discrepancies and gaps

PhilPapers

Directory of Open Access Journals

PubMed Central

University of Delaware Library Institutional Repository

The representation of protein complexes in the Protein Ontology

Author: Arighi Cecilia
Blake Judith
Bult Carol
Drabkin Harold
D’Eustachio Peter
Evsikov Alexei
Natale Darren
Roberts Natalia
Ruttenberg Alan
Smith Barry
Wu Cathy
Publication venue
Publication date: 01/01/2011
Field of study

Representing species-specific proteins and protein complexes in ontologies that are both human and machine-readable facilitates the retrieval, analysis, and interpretation of genome-scale data sets. Although existing protin-centric informatics resources provide the biomedical research community with well-curated compendia of protein sequence and structure, these resources lack formal ontological representations of the relationships among the proteins themselves. The Protein Ontology (PRO) Consortium is filling this informatics resource gap by developing ontological representations and relationships among proteins and their variants and modified forms. Because proteins are often functional only as members of stable protein complexes, the PRO Consortium, in collaboration with existing protein and pathway databases, has launched a new initiative to implement logical and consistent representation of protein complexes. We describe here how the PRO Consortium is meeting the challenge of representing species-specific protein complexes, how protein complex representation in PRO supports annotation of protein complexes and comparative biology, and how PRO is being integrated into existing community bioinformatics resources. The PRO resource is accessible at http://pir.georgetown.edu/pro/

PhilPapers

Protein Ontology: A controlled structured network of protein entities

Author: Arighi Cecilia N.
Blake Judith A.
Bult Carol J.
Christie Karen R.
Diehl Alexander D.
Drabkin Harold J.
Julie Cowart
Natale Darren A.
Olivia Helfer
Others
Peter D’Eustachio
Smith Barry
Publication venue
Publication date: 01/01/2013
Field of study

The Protein Ontology (PRO; http://proconsortium.org) formally defines protein entities and explicitly represents their major forms and interrelations. Protein entities represented in PRO corresponding to single amino acid chains are categorized by level of specificity into family, gene, sequence and modification metaclasses, and there is a separate metaclass for protein complexes. All metaclasses also have organism-specific derivatives. PRO complements established sequence databases such as UniProtKB, and interoperates with other biomedical and biological ontologies such as the Gene Ontology (GO). PRO relates to UniProtKB in that PRO’s organism-specific classes of proteins encoded by a specific gene correspond to entities documented in UniProtKB entries. PRO relates to the GO in that PRO’s representations of organism-specific protein complexes are subclasses of the organism-agnostic protein complex terms in the GO Cellular Component Ontology. The past few years have seen growth and changes to the PRO, as well as new points of access to the data and new applications of PRO in immunology and proteomics. Here we describe some of these developments

PhilPapers

Reactome pathway analysis: a high-performance in-memory approach

Author: Antonio Fabregat
Guilherme Viteri
Henning Hermjakob
Konstantinos Sidiropoulos
Lincoln Stein
Oscar Forner
Pablo Marin-Garcia
Peter D’Eustachio
Vicente Arnau
Publication venue: Springer Nature
Publication date: 01/01/2017
Field of study

BACKGROUND: Reactome aims to provide bioinformatics tools for visualisation, interpretation and analysis of pathway knowledge to support basic research, genome analysis, modelling, systems biology and education. Pathway analysis methods have a broad range of applications in physiological and biomedical research; one of the main problems, from the analysis methods performance point of view, is the constantly increasing size of the data samples. RESULTS: Here, we present a new high-performance in-memory implementation of the well-established over-representation analysis method. To achieve the target, the over-representation analysis method is divided in four different steps and, for each of them, specific data structures are used to improve performance and minimise the memory footprint. The first step, finding out whether an identifier in the user’s sample corresponds to an entity in Reactome, is addressed using a radix tree as a lookup table. The second step, modelling the proteins, chemicals, their orthologous in other species and their composition in complexes and sets, is addressed with a graph. The third and fourth steps, that aggregate the results and calculate the statistics, are solved with a double-linked tree. CONCLUSION: Through the use of highly optimised, in-memory data structures and algorithms, Reactome has achieved a stable, high performance pathway analysis service, enabling the analysis of genome-wide datasets within seconds, allowing interactive exploration and analysis of high throughput data. The proposed pathway analysis approach is available in the Reactome production web site either via the AnalysisService for programmatic access or the user submission interface integrated into the PathwayBrowser. Reactome is an open data and open source project and all of its source code, including the one described here, is available in the AnalysisTools repository in the Reactome GitHub (https://github.com/reactome/)

University of Toronto Research Repository

Crossref

Springer - Publisher Connector

PubMed Central

Digital.CSIC

Reactome knowledgebase of human biological pathways and processes

Author: Birney Ewan
Caudy Michael
Croft David
de Bono Bernard
D’Eustachio Peter
Garapati Phani
Gillespie Marc
Gopinath Gopal
Hemish Jill
Hermjakob Henning
Jassal Bijay
Kanapin Alex
Lewis Suzanna
Mahajan Shahana S
Matthews Lisa
May Bruce
Schmidt Esther
Stein Lincoln
Vastrik Imre
Wu Guanming
Publication venue: CUNY Academic Works
Publication date: 03/01/2009
Field of study

Reactome (http://www.reactome.org) is an expert-authored, peer-reviewed knowledgebase of human reactions and pathways that functions as a data mining resource and electronic textbook. Its current release includes 2975 human proteins, 2907 reactions and 4455 literature citations. A new entity-level pathway viewer and improved search and data mining tools facilitate searching and visualizing pathway data and the analysis of user-supplied high-throughput data sets. Reactome has increased its utility to the model organism communities with improved orthology prediction methods allowing pathway inference for 22 species and through collaborations to create manually curated Reactome pathway datasets for species including Arabidopsis, Oryza sativa (rice), Drosophila and Gallus gallus (chicken). Reactome\u27s data content and software can all be freely used and redistributed under open source terms

City University of New York

Cystin, a novel cilia-associated protein, is disrupted in the cpk mouse model of polycystic kidney disease

Author: Beier David R.
D’Eustachio Peter
Guay-Woodford Lisa M.
Hou Xiaoying
Kremmidiotis Gabriel
Lefkowitz Elliot J.
Mrug Michal
Yoder Bradley K.
Publication venue: American Society for Clinical Investigation
Publication date: 01/01/2002
Field of study

The congenital polycystic kidney (cpk) mutation is the most extensively characterized mouse model of polycystic kidney disease (PKD). The renal cystic disease is fully expressed in homozygotes and is strikingly similar to human autosomal recessive PKD (ARPKD), whereas genetic background modulates the penetrance of the corresponding defect in the developing biliary tree. We now describe the positional cloning, mutation analysis, and expression of a novel gene that is disrupted in cpk mice. The cpk gene is expressed primarily in the kidney and liver and encodes a hydrophilic, 145–amino acid protein, which we term cystin. When expressed exogenously in polarized renal epithelial cells, cystin is detected in cilia, and its expression overlaps with polaris, another PKD-related protein. We therefore propose that the single epithelial cilium is important in the functional differentiation of polarized epithelia and that ciliary dysfunction underlies the PKD phenotype in cpk mice

Crossref

PubMed Central

Correction: Toll-Like Receptor Signaling in Vertebrates: Testing the Integration of Protein, Complex, and Pathway Data in the Protein Ontology Framework

Author: Alan Ruttenberg
Anna Maria Masci
Barry Smith
Cathy Wu
Cecilia Arighi
Darren A. Natale
Peter D’Eustachio
Veronica Shamovsky
Publication venue: 'Public Library of Science (PLoS)'
Publication date
Field of study

Crossref